
Page ! of 7 



ENTERED 



1636 



RAW SEQUENCE LISTING DATE : 08/15/2000 

PATENT APPLICATION: US/09/430 , 5 90A TIME : 11:58:10 

Input Set : A:\ES.txt 

Output Set: N:\CRF3\08152000\I430590A.raw 



3 <110> APPLICANT: POULTER et al . 

4 POULTER et al. 

6 <120> TITLE OF INVENTION : UNUSUAL RETROTRANSPOSON FROM THE YEAST CANDIDA ALBICANS 
8 <130> FILE REFERENCE: 674521-2001.1 

10 <140> CURRENT APPLICATION NUMBER: 09/430, 590A 

11 <141> CURRENT FILING DATE: 1999-10-29 
13 <160> NUMBER OF SEQ ID NOS : 3 
15 <170> SOFTWARE: Patentln Ver . 2.1 

17 <210> SEQ ID NO: 1 

18 <211> LENGTH: 6426 

19 <212> TYPE: DNA 

20 <213> ORGANISM: Candida albicans 
22 <4 00> SEQUENCE: 1 

2 3 tgttggtttg tgcactattt tgtgtcagaa actgatcaat gaaaatgatg gttattatga 60 

24 gaatggaaaa tttttccatc acacatcagg tgatgacaga actaaactat attgtgtagt 120 

2 5 ataaataagg gtatgaaata ccaacatccc agaatatcaa cgagatagaa gggaggagtt 180 

2 6 tcaatatata tcttgtgaat aataacttcg ttctaattca ctatacacaa ctagacgtgt 24 0 

27 acacgctcaa tctcaggtaa agaaagttta tattccatca gattagaagt cgatagtgat 300 

28 aatcatttcg tcccaaatta gcgttgtata aattcagtcc tcagatttgt attattgatt 360 

29 gatagtttcg aagtttgaag gtacagaatt tcacaagatg agttccgcaa agaatgatga 420 

30 taacgaaggg aaggtcatgg aaagtgttga tcaagctaat gctattagta aggtggatga 480 

31 acatatcaag gctagattca atatgctttt cataaaattt aatgacttac ctaagttggc 54 0 

32 cgtcggtaat cagaaaagcg tggataaatg gaatgaagaa tttaaatatt tccacgttgc 60 0 

33 tttatatgtt gatgatattc ttatggttgg aagttcacaa aaagttattg ataattttgt 660 

34 ttaccccgat gttttggaat ttttgcttga ctataatcct aaagataaat tcaaggttaa 720 

3 5 aaaggtagaa ggtatttatt ttactggttg gtgtttacaa atgtgtttac agtccatttt 780 

36 tgataggttc agattgatca tgatttctaa gctaccaaag cacttgcaaa aggaagcaaa 840 

37 cttaatcaaa gctgcttatg atgctgttac taaatctaaa gattatacca ttactagtaa 900 

38 gatcttgctg aagtttgtaa acgttgaaca tgagttagtg gtttgctata accttccata 960 

39 tttgctgcag gtggaagaga aacttgagga aatactctac aacacttcaa acgttgtcga 1020 

4 0 tgagtatgtc cgtagtcttc caaatctcat aggtcaagtc ttgtacttca atcatgtgaa 1080 

41 gaaatcagag gctttaagtt tgtttttgaa tattcatgcc tcatactact caaagtggat 1140 

42 tcaagctgac aatgatacat cagtactccc aagttgctct accatagctg aagaaatgtg 12 00 
4 3 tgatcatcct gattatgcta gattggttga cattccaagc aacaaatatg aacttaatct 12 60 

44 tattgttagt ttaccagcac cagagaaacc aaaaggaaaa ccagaggaga actcactgga 1320 

45 acaatctcaa aagaagaacc tgaaatcaag aaagagaaat aagaaacatc caaaatcaga 1380 

46 taacgataaa ggtgaaaaag aaaaagaaaa agaaaaaact tcactggaat gaaaaacagg 1440 

47 tgctgcttct attaattgtg taatgaatat acataattgc agcaaaacca cgtttccagt 1500 

48 agaaaattct cattctctta atgcttcttt gaacgtaatg aattttaaag gtttaaggtt 1560 

49 taacaagtat ctagtgtatg atactggtgc cacaatatct gttgtgaaca ataaagatat 1620 

50 attgctgaat gttaaggacg caacaattga agtttctgtt gctgatggtg ctacattaga 1680 

51 agcagattgt attggtgatc taattatcag agtcggtatt gtctcgatta cgttagagaa 1740 

52 tacattgtat ttaccagaaa gttcctttaa tcttgtgagt ttgaaacaaa ttgaagaacg 1800 

53 aggatttaat gttcttatta ctaaagaatc agtgattgta tttaaccaaa atgtggctcc 1860 

54 tactattatt gcttcaagga agaatgctgc tgatctttat atgggtcctc aattcagtga 1920 

55 agaatcttta gaatgtgatt ttgattatga tggtttggca gatatgttgt ccaatgctaa 1980 

56 ccaagatgac aaagataaat caagtatgaa tgaaatgtca gaatatcaag aacatgatta 2040 
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57 tagttctcga gcattaataa attctttgac ggaggttgat gttttagatg ttgaaatttc 2100 

58 cccatatgga gttgaacaat tgctaccaac tggagataag aacgatattt ataatttcca 2160 

59 tttgatgtca aatcatatgt ccattgagaa aatcttgttg ttacaaaaat accagggtct 2220 

60 cgtacttcac acttcaaaag agagtcttca aaagattgct gattgtaagg tatgtctatt 2280 

61 atcgaatgcc aaacagagaa gtcacaatca tcattcagaa agaaaagcct cgagaagaca 2340 

62 tgagagactt cattgtgata ctctcggtcc atttaggtcc gaaaataaca agtggtattt 2400 

63 aacgtctgtt atagatgaac atacgggtta cattgaagga attattacta aagacagaaa 2460 

64 ggtaaaggat ctcttaattc aacgattaaa gatctggaat aatcggttta acgataaggt 2520 

65 ggcatacttc agaagtgata atgctcctga gttcccacaa ccttctgatt tagctgagtt 2580 

66 cggtatttgg agggagacta tagcggcata tctgcctgag cttaatggtc tcgccgaggt 2640 

67 tgttaataaa ttgattttac aacagattta caggatcgtt gtgacacttg gtccacaaat 2700 

68 actcaagttg atttattatg tgattcaata ttctattaca atgatcaacc acactccacg 2760 

69 tcgttcactc aagggacaaa ccccttatgg ttgctattat caattaagtg agggaaattt 2820 

70 ctaccggttt ccttttgcca tcgattgtgt cgttacattt agtaatgcca tcgaaaagaa 2880 

71 ccgttacgga gttacatcaa ctaaaggagc tccttcatcg atcatgggtg ctgtgattgg 2940 

72 ctacgctagc gattgtttta gttattacgt gttgctaaaa aatatgcggt gtgatattat 3000 

73 ccttagccct aatgtccgta tattgcgaag ctatgaggtt attaactcct atctcaaaaa 3060 

74 cttatccact acacctatgt cacacattgt tcctatggct gaaggtatcc agggaaggca 3120 

75 actgggcgct cagtacgagg tacgcggaac atatgtggaa agtgaatatg acaatacaaa 3180 

76 tgacgtgatg cacatgccca aagagtcata ttcagttcag ccagcatcgt ttactttaac 3240 

77 tacgggtaac agttctaacg aatatgttat aaatgatgat ccagtacaga ttaccattga 3 300 

78 gaatcccgat gatttttcta accctcttca actaactgaa gaatcacacg atatggtatc 3 360 

79 cgaagtaaaa tcggatgaga atcctaaacc cagtctccac gagctaacac ctggggataa 34 20 

80 tccggtgtct aaacctcctc aacttggtac cgagacttca gtaataggga agtctaaaga 34 80 

81 gcctattaca aaccacacaa aggacgcccc ttccatccag gggagggacc ataaacgcct 3540 

82 ggaatctact gctcaggttg gactatcaca ccaaccccag actggtactc ccgcttcgga 3600 

83 ggagtcaaaa ttgtcaggaa cagatcattt cggtgtcgac gttgttaaag aaacagtctc 3660 

84 agaagattgg catacttctg actacccaga aactagtgct gaagatgaac agcaaaatcc 3720 

85 ctcgttactg gctaataaga atcgggtaac tgaaaaaata gatgagggag aaaatatttc 3780 

86 atttccgggg ggtgatgatg attctgtcgt gatcaactca aatgttgagc aatctaatgt 3840 

87 tgaaacagag gatgctggta acagtccaat tcaagacgaa gtttctcaag agggaagaat 3 900 

88 acttaatgaa caaactgata tagttgatac tgttgctaaa gttattgaga atgaaaaaat 3960 

89 ctctcctatt aattcattag atgatcatac tgaacttgct acagactcgg gaaatgatag 4020 

90 caattcaaca gaatccgaca ttcaatcgaa aaatgaaata tcaccagtga ttaatgagaa 4080 

91 aaatactgaa ataatccaaa aacacattga aagtatcctt gctgataaga gattggatga 414 0 

92 atttgaaacg tataatgttg atgaaattga gaatgtgatt aatgacgatg acattgctga 4200 

93 agctaatcca ctaccagatg aaaataatga tgttcagatg aatgagagtt ttgataataa 4260 

94 tcatagcatg tcacgagcaa agaagaaata cacatttgag aaagaagtta acgaaaaaat 4320 

95 tgctggtact aaacattcac ttgatacaac tgatccaaga gaagcaatca gagtgttaaa 4380 

96 tactggtgaa accaagagaa tcgaacccaa gaaaagagag gtgcctatca ctgtgaaatt 4440 

97 aaacaaaaga tcgcaataca agtcaccata tgttacaaga agtggtagaa cggttataaa 4500 

98 ccccaagagg tatttacatg cggtcgtcaa caaaatcgac tataatgatc cgggatggat 4560 

99 aaagtcaatg aatgctgaac tagagaaatt tagatcaaaa gatgtttacg aagaagttcc 4 620 

100 aattcccacc ggtgtgaagc ctatatctat gggttgggta catactgaga aaattgattc 4 680 

101 tctcaaaggt gttgttcgga aatcacgttg tgttgtccat ggcaacagac aaaaggaaaa 4 74 0 

102 attggattat gaccctttta gtgttagttc acctgttata gatcttgtga ctataagatt 4800 

103 attgacaata ataggttgtg aattaggaat gacaattcaa catttagacg tcgagtcggc 4 860 

104 gtatctaaat gcctctatta ctcattcaaa tccaatttat gtctttcctc ctaaatcagt 4920 

105 acctttgaag aaaaaccatt gttggttatt gaaacgttct gtctatgggt taaaacagtc 4 980 
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106 gggtttggaa tggtatcaca ctatcaaaag agtattggaa gacattggtt ttactcaagt 504 0 

107 tttatatgtt gatgatattc ttatggttgg aagttcacaa aaagttattg ataattttgt 5100 

108 ggatcaattg agagatcatt ttgaagttaa agtgtttggt gaaatatcaa attatcttgg 5160 

109 tattgaattt cgtaaaaccg aatctggtta tattttatct caagaaaaat ttctcaagaa 5220 

110 attacttaag gatttcaaac tagatgactc atatgggaaa aacataccct ggattccgaa 5280 

111 tgacaaatat gaaaaggttg caataattcg tgaaaacgtt aatccagaga atgattttga 534 0 

112 aaaggttccg aatgagacat tgcttgaccc tgatgctaaa aaactatacc aaagtggtgt 5400 

113 tggcctgctt ttatgggctg ccacaaacac acgtccagat atatcggtcg tagtgaattc 5460 

114 gttgggttct aaatctgcaa atccaaatgt ccatgattat gagaaattga tttattgtct 5520 

115 taggtatatc aaaaatagca tgggatatca cattgagtac aaaagaaaca gattgaatat 5580 

116 accaccaaaa tcatttgtta tcgaatgttt cagtgatgcg tcatttgcac caggattgga 5640 

117 tagaaaatct attagtggaa ctttgattta tgtgaatgga aatttggtgc aatgggcgac 5700 

118 caaaaaacaa acggtcatag cacaaagctc agcagcttgt gaaatgttgg ctctaaatta 5760 

119 tacaatgttg aaagctatcg aaataaaaaa ccatttaatg gatttgggtt ttgaagtagg 5820 

120 taagatacat tgtcatcaag acaaccaagc tgtgattaaa gttttgagaa ataactattg 5880 

121 tcacccacat cgaccaatag atatctgcta taagtttcta cgccaattga tcaatgataa 5940 

122 agtattttca atatcctatg tgaagacaaa tgataattac gccgattgta tgactaagtg 6000 

123 tctaagtcgt gctaaattca aagcattcgt tgagggtatg ataaaacggt tagacctaga 6060 

124 agataatcaa acactgatac aaaatgcaat aacggcagaa taagtggatt tatcattact 6120 

125 attatcgtaa tgctcaatca ggggagtgtt ggtttgtgca ctattttgtg tcagaaactg 6180 

126 atcaatgaaa atgatggtta ttatgagaat ggaaaatttt tccatcacac atcaggtgat 6240 

127 gacagaacta aactatattg tgtagtataa ataagggtat gaaataccaa catcccagaa 6 300 

128 tatcaacgag atagaaggga ggagtttcaa tatatatctt gtgaataata acttcgttct 6 360 

129 aattcactat acacaactag acgtgtacac gctcaatctc aggtaaagaa agtttatatt 6420 

130 ccatca 642 g 

133 <210> SEQ ID NO: 2 

134 <211> LENGTH: 324 

135 <212> TYPE: PRT 

136 <213> ORGANISM: Candida albicans 
138 <4 00> SEQUENCE: 2 
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167 145 150 155 160 

169 Val Val Cys Tyr Asn Leu Pro Tyr Leu Ser Gin Val Glu Glu Lys Leu 

170 165 170 175 

172 Glu Glu lie Leu Tyr Asn Thr Ser Asn Val Val Asp Glu Tyr Val Arg 

173 180 185 190 

175 Ser Leu Pro Asn Leu He Gly Gin Val Leu Tyr Phe Asn His Val Lvs 
!76 195 200 205 

178 Lys Ser Glu Ala Leu Ser Leu Phe Leu Asn He His Ala Ser Tyr Tyr 

179 210 215 220 

181 Ser Lys Trp He Gin Ala Asp Asn Asp Thr Ser Val Leu Pro Ser Cys 

182 225 230 235 240 

184 Ser Thr He Ala Glu Glu Met Cys Asp His Pro Asp Tyr Ala Arg Leu 

185 245 250 255 

187 Val Asp He Pro Ser Asn Lys Tyr Glu Leu Asn Leu He Val Ser Leu 

188 260 265 270 

190 Pro Ala Pro Glu Lys Pro Lys Gly Lys Pro Glu Glu Asn Ser Ser Glu 

191 275 280 285 

193 Gin Ser Gin Lys Lys Asn Ser Lys Ser Arg Lys Arg Asn Lys Lys His 

194 290 295 300 

196 Pro Lys Ser Asp Asn Asp Lys Gly Glu Lys Glu Lvs Glu Lys Glu Lvs 

197 305 310 315 * 3 20 
199 Thr Ser Ser Glu 

203 <210> SEQ ID NO: 3 

204 <211> LENGTH: 1576 

205 <212> TYPE: PRT 

206 <213> ORGANISM: Candida albicans 

208 <400> SEQUENCE : 3 

209 Lys Thr Gly Ala Ala Ser He Asn Cys Val Met Asn He His Asn Cvs 

210 1 5 io is 

212 Ser Lys Thr Thr Phe Pro Val Glu Asn Ser His Ser Leu Asn Ala Ser 

213 20 25 30 

215 Leu Asn Val Met Asn Phe Lys Gly Leu Arg Phe Asn Lys Tyr Leu Val 

216 35 40 45 

218 Tyr Asp Thr Gly Ala Thr He Ser Val Val Asn Asn Lys Asp He Leu 

219 50 55 60 

221 Ser Asn Val Lys Asp Ala Thr He Glu Val Ser Val Ala Asp Gly Ala 

222 65 70 75 80 

224 Thr Leu Glu Ala Asp Cys He Gly Asp Leu He He Arg Val Gly He 

225 85 90 95 

227 Val Ser lie Thr Leu Glu Asn Thr Leu Tyr Leu Pro Glu Ser Ser Phe 

228 100 105 no 

230 Asn Leu Val Ser Leu Lys Gin He Glu Glu Arg Gly Phe Asn Val Leu 

231 115 120 125 

233 lie Thr Lys Glu Ser Val He Val Phe Asn Gin Asn Val Ala Pro Thr 

234 130 135 140 

236 He lie Ala Ser Arg Lys Asn Ala Ala Asp Leu Tyr Met Gly Pro Gin 

237 145 150 155 160 

239 Phe Ser Glu Glu Ser Leu Glu Cys Asp Phe Asp Tyr Asp Gly Leu Ala 

240 165 170 175 
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